Search CORE

89 research outputs found

Towards Automatic Capturing of Semi-structured Process Provenance

Author: A. Misra
B. Ludascher
L. Moreau
M. Szomszor
M.D. Allen
T. Oinn
Y. Cui
Y.L. Simmhan
Y.L. Simmhan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Reactive resource provisioning heuristics for dynamic dataflows on cloud infrastructure

Author: Frincu M
Kumbhare AG
Prasanna VK
Simmhan Y
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/04/2015
Field of study

The need for low latency analysis over high-velocity data streams motivates the need for distributed continuous dataflow systems. Contemporary stream processing systems use simple techniques to scale on elastic cloud resources to handle variable data rates. However, application QoS is also impacted by variability in resource performance exhibited by clouds and hence necessitates autonomic methods of provisioning elastic resources to support such applications on cloud infrastructure. We develop the concept of “dynamic dataflows” which utilize alternate tasks as additional control over the dataflow's cost and QoS. Further, we formalize an optimization problem to represent deployment and runtime resource provisioning that allows us to balance the application's QoS, value, and the resource cost. We propose two greedy heuristics, centralized and sharded, based on the variable-sized bin packing algorithm and compare against a Genetic Algorithm (GA) based heuristic that gives a near-optimal solution. A large-scale simulation study, using the linear road benchmark and VM performance traces from the AWS public cloud, shows that while GA-based heuristic provides a better quality schedule, the greedy heuristics are more practical, and can intelligently utilize cloud elasticity to mitigate the effect of variability, both in input data rates and cloud resource performance, to meet the QoS of fast data applications

Crossref

Nottingham Trent Institutional Repository (IRep)

Reproducibility of scientific workflows execution using cloud-aware provenance (ReCAP)

Author: C Scheidegger
E Deelman
EHBM Gronenschild
G Juve
Ilkay Altintas
J Kim
Johannes Starlinger
K Munir
K Munir
Kamran Munir
Kamran Munir
Khawar Hasham
R Sakellariou
T Glatard
W Stallings
Y Simmhan
YL Simmhan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2018
Field of study

© 2018, Springer-Verlag GmbH Austria, part of Springer Nature. Provenance of scientific workflows has been considered a mean to provide workflow reproducibility. However, the provenance approaches adopted so far are not applicable in the context of Cloud because the provenance trace lacks the Cloud information. This paper presents a novel approach that collects the Cloud-aware provenance and represents it as a graph. The workflow execution reproducibility on the Cloud is determined by comparing the workflow provenance at three levels i.e., workflow structure, execution infrastructure and workflow outputs. The experimental evaluation shows that the implemented approach can detect changes in the provenance traces and the outputs produced by the workflow

Crossref

UWE Bristol Research Repository

Document Provenance in the Cloud: Constraints and Challenges

Author: P. Buneman
P.P. Silva Da
S. Davidson
S. Miles
Y. Simmhan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Crossref

Fog paradigm for local energy management systems

Author: A Sheikhi
C Marmaras
C Wei
D Wijayasekara
J Diaz-Montes
J Laustsen
L Pérez-Lombard
N Yaagoubi
RC Green
S Bera
S Maharjan
T Weng
Y Simmhan
ZM Fadlullah
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/10/2017
Field of study

Cloud Computing infrastructures have been extensively deployed to support energy computation within built environments. This has ranged from predicting potential energy demand for a building (or a group of buildings), undertaking heat profile/energy distribution simulations, to understanding the impact of climate and weather on building operation. Cloud computing usage in these scenarios have benefited from resource elasticity, where the number and types of resources can change based on the complexity of the simulation being considered. While there are numerous advantages of using a cloud based energy management system, there are also significant limitations. For instance, many such systems assume that the data has been pre-staged at a cloud platform prior to simulation, and do not take account of data transfer times from the building to the simulation platform. The need for supporting computation at edge resources, which can be hosted within the building itself or shared within a building complex, has become important over recent year. Additionally, network connectivity between the sensing infrastructure within a built environment and a data centre where analysis is to be carried out can be intermittent or may fail. There is therefore also a need to better understand how computation/analysis can be carried out closer to the data capture site to complement analysis that would be undertaken at the data centre. We describe how the Fog computing paradigm can be used to support some of these requirements, extending the capability of a data centre to support energy simulation within built environments

Crossref

Online Research @ Cardiff

mHealth system for the early detection of infectious diseases using biomedical signals

Author: A Adaji
A Uqaili
CL Ventola
E Shakshuki
FF Costa
J Vilaplana
KP High
L Catarinucci
M Chatwin
MK Ong
NM Kumar
P Wu
S Newman
V Kontis
Y Simmhan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/01/2020
Field of study

Latin American Congress on Automation and Robotics LACAR 2019, 30/10/2019-01/11/2019, Cali, Colombia.Detection at an early stage of an infection is a major clinical challenge. An infection that is not diagnosed in time can not only seriously affect the health of the infected patient, but also spread and initiate a contagious approach towards other people. This paper deals with mHealth system for medical care and pre-diagnosis. The developed mHealth system use an Android App that collects physiological signals from the patients with a portable and easy-to-use sensors kit. The focus of the work is put on being able to build a low-cost system that using a very small amounts of data (one set record per patient and day). The processed data are uploaded to an online database to train a clinical decision support system to automatically diagnose infections. The mHealth system may be operated by the same personnel on site not requiring to be medical or computational skilled at all. The implementation takes five kinds of measures simultaneously (Electrodermal Activity, Body Temperature, Blood Pressure, Heart Beat Rate and Oxygen Saturation (SPO2)). A real implementation has been tested and results confirm that the sampling process can be done very fast and steadily Finally, the App usability was tested, showing a fast learning curve and no significant differences are observable in learning time by people with different skills or age. These usability factors are key for the mHealth system success

e_Buah - Biblioteca Digital de la Universidad de Alcalá

Crossref

Determining the Trustworthiness of New Electronic Contracts

Author: A.J.I. Jones
C. Castelfranchi
D. Gambetta
D.W. Aha
G. Governatori
J. Sabater
L. Moreau
L. Torre van der
S. Ruohomaa
Y. Gil
Y. Simmhan
Publication venue: Springer
Publication date: 01/01/2009
Field of study

Expressing contractual agreements electronically potentially allows agents to automatically perform functions surrounding contract use: establish- ment, fulfilment, renegotiation etc. For such automation to be used for real busi- ness concerns, there needs to be a high level of trust in the agent-based system. While there has been much research on simulating trust between agents, there are areas where such trust is harder to establish. In particular, contract proposals may come from parties that an agent has had no prior interaction with and, in competitive business-to-business environments, little reputation information may be available. In human practice, trust in a proposed contract is determined in part from the content of the proposal itself, and the similarity of the content to that of prior contracts, executed to varying degrees of success. In this paper, we argue that such analysis is also appropriate in automated systems, and to provide it we need systems to record salient details of prior contract use and algorithms for as- sessing proposals on their content.We use provenance technology to provide the former and detail algorithms for measuring contract success and similarity for the latter, applying them to an aerospace case study

CiteSeerX

VU Research Portal

Crossref

King's Research Portal

A Core Calculus for Provenance

Author: A. Bohannon
A. Cirillo
C. Dimoulas
J. Cheney
J. Cheney
J. Hidders
L. Moreau
N. Guts
P. Buneman
P. Buneman
R. Bose
U.A. Acar
Y. Simmhan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Provenance is an increasing concern due to the ongoing revolution in sharing and processing scientific data on the Web and in other computer systems. It is proposed that many computer systems will need to become provenanceaware in order to provide satisfactory accountability, reproducibility, and trust for scientific or other high-value data. To date, there is not a consensus concerning appropriate formal models or security properties for provenance. In previous work, we introduced a formal framework for provenance security and proposed formal definitions of properties called disclosure and obfuscation. In this article, we study refined notions of positive and negative disclosure and obfuscation in a concrete setting, that of a general-purpose programing language. Previous models of provenance have focused on special-purpose languages such as workflows and database queries. We consider a higher-order, functional language with sums, products, and recursive types and functions, and equip it with a tracing semantics in which traces themselves can be replayed as computations. We present an annotation-propagation framework that supports many provenance views over traces, including standard forms of provenance studied previously. We investigate some relationships among provenance views and develop some partial solutions to the disclosure and obfuscation problems, including correct algorithms for disclosure and positive obfuscation based on trace slicing.

arXiv.org e-Print Archive

CiteSeerX

Crossref

Edinburgh Research Explorer

MPG.PuRe

Concept and benchmark results for Big Data energy forecasting based on Apache Spark

Author: A McAfee
C Monteiro
H Hassani
H Maaß
J Antonanzas
J Jung
JÁ González Ordiano
JÁ González Ordiano
L Breiman
L Fahrmeir
MJ Duran
R Shyam
RJ Hyndman
S Fan
S Gottwalt
S Klaiber
S Pelland
S Zhou
T Gneiting
T Gneiting
T Hong
V Hagenmeyer
X Fang
Y Simmhan
Publication venue: SpringerOpen
Publication date: 01/03/2018
Field of study

The present article describes a concept for the creation and application of energy forecasting models in a distributed environment. Additionally, a benchmark comparing the time required for the training and application of data-driven forecasting models on a single computer and a computing cluster is presented. This comparison is based on a simulated dataset and both R and Apache Spark are used. Furthermore, the obtained results show certain points in which the utilization of distributed computing based on Spark may be advantageous

Crossref

KITopen

Directory of Open Access Journals

A manifesto for future generation cloud computing: research directions for the next decade

Author: Bahsoon R
Buyya R
Calheiros R
Casale G
De Assuncao MD
De Capitani di Vimercati S
Gelenbe E
Gentzsch W
Javadi B
Jin H
Llorente IM
Milojicic D
Netto MAS
Rana O
Rodriguez MA
Samarati P
Shen H
Simmhan Y
Srirama S
Toosi AN
Vaquero LM
Varela C
Varghese B
Zhou W
Zomaya A
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

The Cloud computing paradigm has revolutionised the computer science horizon during the past decade and has enabled the emergence of computing as the fifth utility. It has captured significant attention of academia, industries, and government bodies. Now, it has emerged as the backbone of modern economy by offering subscription-based services anytime, anywhere following a pay-as-you-go model. This has instigated (1) shorter establishment times for start-ups, (2) creation of scalable global enterprise applications, (3) better cost-to-value associativity for scientific and high performance computing applications, and (4) different invocation/execution models for pervasive and ubiquitous applications. The recent technological developments and paradigms such as serverless computing, software-defined networking, Internet of Things, and processing at network edge are creating new opportunities for Cloud computing. However, they are also posing several new challenges and creating the need for new approaches and research strategies, as well as the re-evaluation of the models that were developed to address issues such as scalability, elasticity, reliability, security, sustainability, and application models. The proposed manifesto addresses them by identifying the major open challenges in Cloud computing, emerging trends, and impact areas. It then offers research directions for the next decade, thus helping in the realisation of Future Generation Cloud Computing

arXiv.org e-Print Archive

Queen's University Belfast Research Portal

Spiral - Imperial College Digital Repository

Western Sydney ResearchDirect

Explore Bristol Research